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DETAILED ACTION 

1 . This office action is in response to correspondence filed August 16, 2007 in 
reference to application 10/783,518. Claims 1-33 are pending in the application and 
have been examined. 

Response to Amendment 

2. The amendments filed August 1 6, 2007 have been accepted and considered in 
this office action. Claims 1 , 4, 5, 30, and 33 have been amended. 

Response to Arguments 

3. Applicant's arguments filed August 1 6, 2006 have been fully considered" but they 
are not persuasive. 

4. With respect to applicants arguments, see page 1 1 of remarks, that claims 30-32 
should not be rejected under 35 U.S.C. 101 as a computer readable medium would not 
be interpreted as carrier waves, the examiner respectfully disagrees. A computer 
readable medium, unless defined in the specification to be a storage medium might be a 
carrier wave or electromagnetic or even an optical signal. All of these are non-statutory 
under 35 U.S.C 101. 

5. With respect to applicants arguments, see pages 11-14, that Chien and Baker do 
not teach that the each record includes histories for each of two competing partial 
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hypotheses, the examiner respectfully disagrees. The specification of the application 
does not clearly define what a history is, and it is therefore open for interpretation. In 
the previous rejection, history was interpreted as the part of the word which had already 
been determined by the system of Baker. As described in column 13, for instance, when 
Baker detects a current word having letter string "ORSE", choice list generator matches 
this with other words such as morse, horse, etc. Here each of these matches are choice 
records and the history of the record is "orse." With this interpretation, it Is clear that 
Baker does in fact teach histories associated with records. Although they may not be 
called histories, that is what in effect they are. 

Claim Rejections - 35 USC § 101 

6. 35 U.S.C. 101 reads as follows; 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 

7. Claims 30-33 are rejected under 35 U.S.C. 101 because the claimed invention is 
directed to non-statutory subject matter. Claims 30 and 33 attempts to claim a 
computer readable medium. However, this may be interpreted as merely a carrier wave 
which is considered non-statutory subject matter under 35 U.S.C. 101 . Claims 31 and 
32 are also rejected for the same reasons, as they are dependent of claim 30. 
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Claim Rejections • 35 USC § 103 

8. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention Is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to v\/hich said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

9. The factual inquiries set forth in Graham v. John Deere Co., 383 U.S. 1 , 148 
USPQ 459 (1966), that are applied for establishing a background for determining 
obviousness under 35 U.S.C. 1 03(a) are summarized as follows: 

1 . Determining the scope and contents of the prior art. 

2. Ascertaining the differences between the prior art and the claims at issue. 

3. Resolving the level of ordinary skill in the pertinent art. 

4. Considering objective evidence present in the application indicating 
obviousness or nonobviousness. 

10. Claims 1-6, 13-17, 26, 28, and 33 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Chen et al (US Patent 6,760,720) in view of Baker et al (US Patent 
5,680,511). 

1 1 . Consider claim 1 , Chen teaches a method of constructing a choice list of 
alternate versions of a recognized transcript from a speech recognition system (method 
for generating candidate word strings, abstract), said method comprising: 

generate an alternative version of the transcript (Then, the node sets with relative 
high string scores are selected to connect the nodes by their starting time frame and 
ending time frame, thereby generating the candidate word strings, abstract.); and. 
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adding the alternative version of the transcript to the choice list (Then, the node 
sets with relative high string scores are selected to connect the nodes by their starting 
time frame and ending time frame, thereby generating the candidate word strings, 
abstract. Multiple strings are generated, see table 1.). 

But Chen does not specifically teach: 

during speech recognition, generating a list of close call records, wherein each 
record includes histories for each of two competing partial hypotheses; 

initializing the choice list from at least one output of the speech recognition 
system; 

selecting one of the close call records from the list of close call records; 
selecting a transcript from the choice list; 

determining whether one of the two histories for the selected record matches a 
partial subhistory of the transcript from the choice list; 

if one of the two histories for the selected close call record matches a partial 
subhistory of the transcript, substituting the other of the two histories for the partial 
subhistory of the transcript to generate an alternative string candidate. 

In the same field of word-by-word speech recognition, Baker teaches: 

during speech recognition, generating a list of close call records, wherein each 
record includes histories for each of two competing partial hypotheses (The choice list 
generator 14 couples to the data string memory 12 and generates a plurality of choice 
words 20. The system 10 offers the list of choice words 20, as possible substitutes for 
the current word 38 being analyzed by the system 10; column 9 line 10. The 
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specification of the application does not clearly define what a history is, and it is 
therefore open for interpretation. In the previous rejection, history was interpreted as 
the part of the word which had already been determined by the system of Baker. As 
described in column 13, for Instance, when Baker detects a current word having letter 
string "ORSE", choice list generator matches this with other words such as morse, 
horse, etc. Here each of these matches are choice records and the history of the record 
is "orse." With this interpretation, it is clear that Baker does in fact teach histories 
associated with records.); 

initializing the close call list from at least one output of the speech recognition 
system (The choice list generator 14 couples to the data string memory 12 and 
generates a plurality of choice words 20. The system 10 offers the list of choice words 
20, as possible substitutes for the current word 38 being analyzed by the system 10; 
column 9 line 10. The list must be initialized when considered a word in the string.); 

selecting one of the close call records from the list of close call records (A record 
from the choice word list of Baker must inherently be selected in order to substitute.); 

selecting a transcript from the choice list (an utterance must inherently be 
selected in order to replace words in it); 

determining whether one of the two histories for the selected record matches a 
partial subhistory of the transcript from the choice list (The current word 38 being 
recognized can be transferred via the bus interface 48 to the processing unit 48. The 
processing unit 48 can, in one example, analyze the known information about the 
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current word 38 to select choice words 20A-20E from tiie vocabulary memory 50; 
column 13, line 18.); 

if one of the two histories for the selected close call record matches a partial 
subhistory of the transcript, substituting the other of the two histories for the partial 
subhistory of the transcript to generate an alternative string candidate (Alternatively, the 
word recognition system 10 can select the choice word 20 with the highest probability 
signal 28 or rank signal 30 to substitute for the current word 38 currently being 
recognized; column 17, line 24.). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
the invention was made to incorporate the word substitutions of Baker with the String 
candidate generation method of Chen in order to allow for substitutions of words that 
may have been misinterpreted in the transcription process. 

12. Consider claim 2, Baker teaches the method of claim 1 , further including 
generating a list of close call records, wherein each record includes histories for each of 
two competing word-ending partial hypotheses (The choice list generator 14 couples to 
the data string memory 12 and generates a plurality of choice words 20. The system 10 
offers the list of choice words 20, as possible substitutes for the current word 38 being 
analyzed by the system 10. The choice list generator 14 generates for each choice 
word 20 an associated probability signal 32 that indicates the likelihood that the choice 
word 20 represents the current word 38 being recognized; column 9 line 10.). 
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13. Consider claim 3, Chen teaches the method of claim 2, further including 
generating a list of close call records, wherein each record includes histories for each of 
two competing word-ending partial hypotheses, both seeding a common word (Figure 1 
shows nodes n2 n6 and n9 all seeding to either n3 or n7. Nodes represent words In 
this figure.). 

14. Consider claim 4, Chen teaches the method of claim 1 , further including 
initializing the choice list with the recognized transcript (method for generating candidate 
word strings, abstract. In order to generate strings, they must be stored in a list and it is 
inherent that the list would be initialized.). 

1 5. Consider claim 5, the method of claim 1 ,Chen teaches further including 
initializing the choice list with all active, legal word ending hypotheses (Figure 1 shows 
two separate ending nodes being considered for the node lattice. Therefore multiple 
word-ending hypothesis are considered.). 

16. Consider claim 6. The method of claim 1 , further including comparing the close 
call record selected from the close call list against each transcript in the choice list (It 
would have been obvious to one of ordinary skill in the art to repeat this technique for 
every transcription on the list created by Chen.) 
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17. Consider claim 13, Chen teaches a method of constructing a list of alternate 
versions of a recognized transcript (method for generating candidate word strings, 
abstract), said method comprising: 

adding the recognized transcript to a choice list (method for generating candidate 
word strings, abstract. In order to generate strings, they must be stored in a list and it is 
inherent that the list would be initialized with a recognized transcript.); 

for each entry on the choice list (It would have been obvious to one of ordinary 
skill in the art to repeat this technique of Baker for every transcription on the list created 
by Chen.), 

generating an alternative version of the transcript (Then, the node sets 
with relative high string scores are selected to connect the nodes by their starting 
time frame and ending time frame, thereby generating the candidate word 
strings, abstract.); and, 

(c) adding the alternative version of the transcript to the choice list (Then, 
the node sets with relative high string scores are selected to connect the nodes 
by their starting time frame and ending time frame, thereby generating the 
candidate word strings, abstract. Multiple strings are generated, see table 1.) 
and 

two partial hypothesis that seed a common word (Figure 1 shows nodes n2 n6 
and n9 all seeding to either n3 or n7. Nodes represent words in this figure.). 
Chen does not teach: 



Application/Control Number: 1 0/783,61 8 Page 1 0 

Art Unit: 2626 

during speech recognition, generating a list of close call records, wherein each 
record includes histories for each of two competing partial hypothesis; 
selecting a record from the close call list; 

(a) determining whether one of the two histories for the selected record 
matches a partial subhistory of that entry on the choice list; 

(b) if one of the two histories for the selected record matches a partial 
subhisory of that entry, substituting the other of the two histories for the partial 
subhistory of that entry to generate an alternative version of the transcript. 

In the same field of word-by-word recognition, Baker teaches: 
during speech recognition, generating a list of close call records, wherein each 
record includes histories for each of two competing partial (The choice list generator 14 
couples to the data string memory 12 and generates a plurality of choice words 20. The 
system 10 offers the list of choice words 20, as possible substitutes for the current word 
38 being analyzed by the system 10; column 9 line 10; Baker. The specification of the 
application does not clearly define what a history is, and it is therefore open for 
interpretation. In the previous rejection, history was interpreted as the part of the word 
which had already been determined by the system of Baker. As described in column 13, 
for instance, when Baker detects a current word having letter string "ORSE", choice list 
generator matches this with other words such as morse, horse, etc. Here each of these 
matches are choice records and the history of the record is "orse." With this 
interpretation, it is clear that Baker does in fact teach histories associated with records.); 
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selecting a record from the close call list (a word must inherently be selected in 
order to process and replace it); 

(a) determining whether one of the two histories for the selected record 
matches a partial subhistory of that entry on the choice list (The current word 38 
being recognized can be transferred via the bus interface 48 to the processing 
unit 48. The processing unit 48 can, in one example, analyze the known 
information about the cunrent word 38 to select choice words 20A-20E from the 
vocabulary memory 50; column 13, line 18.); 

(b) if one of the two histories for the selected record matches a partial 
subhistory of that entry, substituting the other of the two histories for the partial 
subhistory of that entry to generate an alternative version of the transcript 
(Alternatively, the word recognition system 10 can select the choice word 20 with 
the highest probability signal 28 or rank signal 30 to substitute for the current 
word 38 currently being recognized; column 17, line 24.). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
the invention was made to incorporate the word substitutions of Baker with the String 
candidate generation method of Chen in order to allow for substitutions of words that 
may have been misinterpreted in the transaiption process. 

18. Consider claim 14, Baker teaches the method of claim 13, further comprising: 
selecting another record from the close call list; and, for each entry on the choice list, 
repeating steps (a) (b) and (c). (Figure 1 , Baker shows multiple choice lists entries 22, 



I 
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that are cor^sidered against the words in the selected sentence. Therefore multiple 
iterations of consideration must be done. ). 

19. Consider claim 15, Chen teaches a method of constructing a list of alternate 
transcripts from a recognized transcript (method for generating candidate word strings, 
abstract), comprising: 

perfomiing speech recognition on a spoken transcripts to generate a best scoring 
hypothesis (Then, the node sets with relative high string scores are selected to connect 
the nodes by their starting time frame and ending time frame, thereby generating the 
candidate word strings, abstract.), wherein performing speech recognition involves at 
each of a plurality of different times throughout the transcript generating two partial 
hypotheses each seeding a common word (Figure 1 , N2 and N6 are two different 
hypothesis that both seed word not N3), said two partial hypotheses including a primary 
hypothesis having a first score and corresponding to a primary partial history (N2 has a 
high score of -164, and would be chosen first; column 3, line 50.) and a competing 
hypothesis having a second score and corresponding to a competing partial history (N6 
has a lower score of -170.); 

at each of the plurality of different times, storing a close call record, wherein said 
close call record includes the primary partial history, the competing partial history, and a 
measure of how close the two competing hypotheses are (Figure 1 , this lattice is stored, 
and it shows words leading to N2 and N6 and also the associated scores, which are a 
measure of how close they are.); 
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Chen does not specifically teach: 

after perfonning speech recognition, using the stored close call records to 
generate a choice list of alternative versions of the best scoring hypothesis 

In the same field of generating alternatives, Baker teaches after performing 
speech recognition, using the stored close call records to generate a choice list of 
alternative versions of the best scoring hypothesis (The choice list generator 14 couples 
to the data string memory 12 and generates a plurality of choice words 20. The system 
10 offers the list of choice words 20, as possible substitutes for the current word 38 
being analyzed by the system 10; column 9 line 10. Alternatively, the word recognition 
system 10 can select the choice word 20 with the highest probability signal 28 or rank 
signal 30 to substitute for the current word 38 currently being recognized; column 17, 
line 24.) 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the Substitutions of Baker with the candidate generation of 
Chin In order to allow for the list to be expanded to allow for words that may have been 
mistranslated. 

20. Consider claim 16, Chen teaches a method of constructing a list of alternate 
transcript from a recognized utterance (method for generating candidate word strings, 
abstract), comprising: 

storing the one or more alternate transcripts in a choice list (Then, the node sets 
with relative high string scores are selected to connect the nodes by their starting time 
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frame and ending time frame, thereby generating the candidate word strings, abstract. 
Multiple strings are generated, see table 1.)- 

and using partial hypothesis seeding a common word (Figure 1 shows nodes n2 
n6 and n9 all seeding to either n3 or n7. Nodes represent words In this figure.) 

But Chen does not specifically teach: 

generating a list of close call records, wherein each record includes history 
information and scoring information associated with a particular pair of partial 
hypotheses; 

generating one or more alternate transcripts from the list of close call records by 
evaluating each record in the list for a match between a partial sub-history of the 
recognized utterance and one of the histories stored in the record, and upon finding 
such a match, substituting the other of the histories stored in the record for the partial 
sub-history in the recognized transcript. 

In the same field of word-by-word recognition. Baker teaches: 

generating a list of close call records, wherein each record includes history 
information and scoring information associated with a particular pair of partial 
hypotheses (The choice list generator 14 couples to the data string memory 12 and 
generates a plurality of choice words 20. The system 10 offers the list of choice words 
20, as possible substitutes for the current word 38 being analyzed by the system 10; 
column 9 line 10; Baker.); 

generating one or more alternate transcripts from the list of close call records by 
evaluating each record in the list for a match between a partial sub-history of the 
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recognized utterance and one of the histories stored in the record, and upon finding 
such a match, substituting the other of the histories stored in the record for the partial 
sub-history in the recognized transcript (The current word 38 being recognized can be 
transferred via the bus interface 48 to the processing unit 48. The processing unit 48 
can, in one example, analyze the known information about the current word 38 to select 
choice words 20A-20E from the vocabulary memory 50; column 13, line 18. 
Alternatively, the word recognition system 10 can select the choice word 20 with the 
highest probability signal 28 or rank signal 30 to substitute for the current word 38 
currently being recognized; column 17, line 24.). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
the invention was made to incorporate the word substitutions of Baker with the String 
candidate generation method of Chen in order to allow for substitutions of words that 
may have been misinterpreted in the transaiption process. 

21 . Consider claim 1 7, Baker teaches a method according to claim 16, further 
including generating additional alternate transcripts by evaluating each record in the list 
of close call records for a match between a partial sub-history of each alternate 
utterance in the choice list and one of the histories stored in the record, and upon 
finding a match, substituting the other of the histories stored in the record for the partial 
sub-history in the alternate transcript; and, storing the additional alternate transcripts in 
the choice list (Alternatively, the word recognition system 10 can select the choice word 
20 with the highest probability signal 28 or rank signal 30 to substitute for the current 



Application/Control Number: 10/783,518 Page 16 

Art Unit: 2626 

word 38 currently being recognized; column 17, line 24. As this analysis is done word 
by word, a choice list is generated for each word considered, so this process is 
repeated until all words in the utterance have been considered.) 

22. Consider claim 26, Chen teaches a method of creating an alternate utterance 
hypothesis from a complete utterance hypothesis (method for generating candidate 
word strings, abstract), comprising: 

for a first partial hypothesis having an associated first score and a second partial 
hypothesis having an associated second score t>eing less than the first score (Figure 1 , 
nodes N2 and N6 represent two words, N6 has lower score of -170 than N2 of -164.), 
both ending at a common time (nodes N2 and N6 both end at frame 64) and both 
seeding a common continuation word (both N2 and N6 seed word N3.), storing 
information characterizing the first partial hypothesis and the second partial hypothesis 
at each frame following the seeding of the common continuation word, the information 
including at least a history of the first partial hypothesis and a history of the second 
partial hypothesis (Figure 1 , this lattice is stored, and it shows words leading to N2 and 
N6 and also the associated scores, which are a measure of how close they are. The 
lattice also shows the node leading up to nodes N2 and N6, being the history.). 

But Chen does not specifically teach; 

comparing a set of first words from the first hypothesis and a set of first words 
from the complete utterance hypothesis; and, 
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if a set of first words from the history of the first partial hypothesis matches a set 
of first words from the complete utterance hypothesis, substituting the history of the 
second partial hypothesis for the history of the first partial hypothesis within the 
complete utterance hypothesis. 

In the same field of word hypotheses. Baker teaches comparing a set of first 
words from the first hypothesis and a set of first words from the complete utterance 
hypothesis (The current word 38 being recognized can be transfenred via the bus 
interface 48 to the processing unit 48. The processing unit 48 can, in one example, 
analyze the known information about the current word 38 to select choice words 20A- 
20E from the vocabulary memory 50; column 13, line 18.); and, 

If a set of first words from the history of the first partial hypothesis matches a set 
of first words from the complete utterance hypothesis, substituting the history of the 
second partial hypothesis for the history of the first partial hypothesis within the 
complete utterance hypothesis (Alternatively, the word recognition system 10 can select 
the choice word 20 with the highest probability signal 28 or rank signal 30 to substitute 
for the current word 38 currently being recognized; column 17, line 24. The specification 
of the application does not clearty define what a history is, and it is therefore open for 
interpretation. In the previous rejection, history was interpreted as the part of the word 
which had already been determined by the system of Baker. As described in column 13, 
for Instance, when Baker detects a current word having letter string "ORSE", choice list 
generator matches this with other words such as morse, horse, etc. Here each of these 
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matches are choice records and the history of the record is "orse." With this 
interpretation, it is clear that Baker does in fact teach histories associated with records). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
the invention was made to incorporate the word substitutions of Baker with the String 
candidate generation method of Chen in order to allow for substitutions of words that 
may have been misinterpreted in the transcription process. 

23. Consider claim 28, Baker teaches a method according to claim 26, further 
including generating the first score and the second score based at least upon input 
acoustic data and a set of language models (Continuing with the above example of an 
current word 38 having the letter string "ORSE," the choice list generator 14, can match 
the identified string with stored vocabulary words. The word 38 could, for example, be 
associated with any of the vocabulary words gorse, horse, norse, morse. Or worse, 
stored in the vocabulary memory 50; column 13, line 22. In one embodiment the choice 
list generator can employ a uni-gram model to select choice words 20 as a function of 
their rate of occurrence in the English language; column 13, line 33.). 

24. Consider claim 29, Chen teaches a method according to claim 26, further 
including comparing the set of first words of the history of the first partial hypothesis to 
the set of words firom the complete utterance hypothesis, wherein the set of words from 
the complete utterance hypothesis Includes all of the words from the first partial 
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hypothesis (Figure 1 shows partial hypothesis ending in N2 and N6, but it is also shown 
that these hypothesis N1 and N5 respectively.). 

25. Consider claim 30, Chen a computer readable medium including stored 
instructions adapted for execution on a processor (a computer readable medium is 
inherent to enable the method), comprising: 

instructions for storing the one or more alternate transcripts in a choice list (Then, 
the node sets with relative high string scores are selected to connect the nodes by their 
starting time frame and ending time frame, thereby generating the candidate word 
strings, abstract. Multiple strings are generated, see table 1 .). 

and instructions for using partial hypothesis seeding a common word (Figure 1 
shows nodes n2 n6 and n9 all seeding to either n3 or n7. Nodes represent words in 
this figure.) 

But Chen does not specifically teach: 

Instructions for generating a list of close call records, wherein each record 
includes history Information and scoring Information associated with a particular pair of 
partial hypotheses; 

Instructions for generating one or more alternate transcripts from the list of close 
call records by evaluating each record in the list for a match between a partial sub- 
history of the recognized utterance and one of the histories stored In the record, and 
upon finding such a match, substituting the other of the histories stored In the record for 
the partial sub-history In the recognized transcript. 
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In the same field of word-by-word recognition, Baker teaches: 
instructions for generating a list of close call records, wherein each record 
includes history information and scoring information associated with a particular pair of 
partial hypotheses (The choice list generator 14 couples to the data string memory 12 
and generates a plurality of choice words 20. The system 10 offers the list of choice 
words 20, as possible substitutes for the current word 38 being analyzed by the system 
10; column 9 line 10; Baker.); 

instructions for generating one or more alternate transcripts from the list of close 
call records by evaluating each record in the list for a match between a partial sub- 
history of the recognized utterance and one of the histories stored in the record, and 
upon finding such a match, substituting the other of the histories stored In the record for 
the partial sub-history in the recognized transcript (The current word 38 being 
recognized can be transferred via the bus interface 48 to the processing unit 48. The 
processing unit 48 can, in one example, analyze the known information about the 
current word 38 to select choice words 20A-20E from the vocabulary memory 50; 
column 13, line 18. Alternatively, the word recognition system 10 can select the choice 
word 20 with the highest probability signal 28 or rank signal 30 to substitute for the 
current word 38 currently being recognized; column 17, line 24. The specification of the 
application does not clearly define what a history is, and it is therefore open for 
interpretation. In the previous rejection, history was interpreted as the part of the word 
which had already been determined by the system of Baker. As described in column 1 3, 
for instance, when Baker detects a current word having letter string "ORSE", choice list 
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generator matches this with other words such as morse, horse, etc. Here each of these 
matches are choice records and the history of the record is "orse." With this 
interpretation, it is clear that Baker does in fact teach histories associated with records). 

Therefore It would have been obvious to one of ordinary skill in the art at the time 
the invention was made to incorporate the word substitutions of Baker with the String 
candidate generation method of Chen in order to allow for substitutions of words that 
may have been misinterpreted In the transcription process. 

26. Consider claim 33, Chen teaches a computer readable medium Including stored 
instructions adapted for execution on a processor (a computer readable medium is 
inherent to enable the method), comprising: 

instructions for generate an alternative version of the transcript (Then, the node 
sets with relative high string scores are selected to connect the nodes by their starting 
time frame and ending time frame, thereby generating the candidate word strings, 
abstract.); and, 

instructions for adding the alternative version of the transcript to the choice list 
(Then, the node sets with relative high string scores are selected to connect the nodes 
by their starting time frame and ending time frame, thereby generating the candidate 
word strings, abstract. Multiple strings are generated, see table 1.). 

But Chen does not specifically teach: 

instructions for during speech recognition, generating a list of close call records, 
wherein each record includes histories for each of two competing partial hypotheses; 
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instructions for initializing the close call list from at least one output of the speech 
recognition system; 

instructions for selecting one of the close call records from the list of close call 
records; 

instructions for selecting a transcript from the choice list; 

instructions for determining whether one of the two histories for the selected 
record matches a partial subhistory of the transcript from the choice list; 

if one of the two histories for the selected close call record matches a partial 
subhistory of the transcript, substituting the other of the two histories for the partial 
subhistory of the transcript to generate an alternative string candidate. 

In the same field of word-by-word speech recognition, Baker teaches: 

instructions for during speech recognition, generating a list of close call records, 
wherein each record includes histories for each of two competing partial hypotheses 
(The choice list generator 14 couples to the data string memory 12 and generates a 
plurality of choice words 20. The system 10 offers the list of choice words 20, as 
possible substitutes for the current word 38 being analyzed by the system 10; column 9 
line 10. The specification of the application does not clearly define what a history is, and 
It Is therefore open for interpretation. In the previous rejection, history was Interpreted 
as the part of the word which had already been determined by the system of Baker. As 
described in column 13, for instance, when Baker detects a current word having letter 
string "ORSE", choice list generator matches this with other words such as morse, 
horse, etc. Here each of these matches are choice records and the history of the record 
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is "orse." With tfiis interpretation, it is clear that Baker does in fact teach histories 
associated with records); 

instructions for initializing the close call list from at least one output of the speech 
recognition system (The choice list generator 14 couples to the data string memory 12 
and generates a plurality of choice words 20. The system 10 offers the list of choice 
words 20, as possible substitutes for the current word 38 being analyzed by the system 
10; column 9 line 10. The list must be initialized when considered a word in the string.); 

instructions for selecting one of the close call records from the list of close call 
records (A record from the choice word list of Baker must inherently be selected in order 
to substitute.); 

Instructions for selecting a transcript from the choice list (an utterance must 
inherently be selected in order to replace words in it); 

instructions for determining whether one of the two histories for the selected 
record matches a partial subhistory of the transcript from the choice list (The current 
word 38 being recognized can be transferred via the bus interface 48 to the processing 
unit 48. The processing unit 48 can, in one example, analyze the known information 
about the current word 38 to select choice words 20A-20E from the vocabulary memory 
50; column 13, line 18.); 

instructions for if one of the two histories for the selected close call record 
matches a partial subhistory of the transcript, substituting the other of the two histories 
for the partial subhistory of the transcript to generate an alternative string candidate 
(Alternatively, the word recognition system 10 can select the choice word 20 with the 
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highest probability signal 28 or rank signal 30 to substitute for the current word 38 
currently being recognized; column 17, line 24.). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
the invention was made to incorporate the word substitutions of Baker with the String 
candidate generation method of Chen in order to allow for substitutions of words that 
may have been misinterpreted in the transcription process. 

27. Claims 10, 18 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Chen in view of Baker as applied to claim 1 above, and further in view of Olsen et al 
(US Patent 6,754,625). 

28. Consider claim 1 0, Chen in view of Baker teaches the method of claim 1 , but 
does not specifically teach further including limiting the list of close call records to a 
preset maximum number of close call records. 

In the same field of detemriining alternative words, Olsen teaches limiting the list 
of close call records to a preset maximum number of close call records (Figure 4, step 
416, restrict number of words added to list based on maximum number of words in list.). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the limits of Olsen with the speech recognition of Chen and 
Baker in order to provide a method of preventing overflow of limited memory resources. 
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29. Consider claim 18, Chen in view of Baker teaches a method according to claim 
16, but does not specifically teach further including limiting the list of close call records 
to a preset maximum number of records. 

In the same field of determining alternative words, Olsen teaches limiting the list 
of close call records to a preset maximum number of close call records (Figure 4, step 
416, restrict number of words added to list based on maximum number of words in list.). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the limits of Olsen with the speech recognition of Chen and 
Baker in order to provide a method of preventing overflow of limited memory resources. 

30. Claims 31 and 32 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Chen in view of Baker as applied to,claim30 above, and further in view of Shon 
(US Patent 6,418,328). 

31 . Consider claim 31 , Chen and Baker teach the computer readable medium of 
claim 30, but does not teach specifically wherein the medium is disposed within a 
mobile telephone apparatus and operates in conjunction with a user interface. 

In the same field of speech recognition Shon teaches using a computer readable 
medium for speech recognition disposed within a mobile telephone apparatus and 
operates in conjunction with a user interface ( A voice dialing method in a mobile 
telephone terminal. Upon reception of the dialing utterance, it is determined whether 
there are one or more prie-registered dialing utterances similar to an input dialing 
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utterance within a first similarity value. If there are more than one dialing utterances 
within the first similarity value, it is determined whether there is a pre-registered dialing 
utterance similar to the input dialing utterance within a second similarity value higher 
than the first similarity value. If there is no dialing utterance within the second similarity 
value, names represented by the dialing utterances within the first similarity value are 
displayed. If a user selects one of the displayed names, a registered telephone number 
corresponding to the selected name is dialed, abstract). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the speech recognition of Baker and Chin with the telephone 
of Shon in order to provide the phone with a more robust method of speech recognition 
for voice dialing. 

32. Consider claim 32, Chen and Baker teach the computer readable medium of 
claim 30, but does not teach specifically wherein the medium is disposed within a 
handheld electronic apparatus and operates in conjunction with a user interface. 

In the same field of speech recognition Shon teaches using a computer readable 
medium for speech recognition disposed within a mobile telephone apparatus and 
operates in conjunction with a user interface ( A voice dialing method in a mobile 
telephone terminal. Upon reception of the dialing utterance, it is determined whether 
there are one or more pre-registered dialing utterances similar to an input dialing 
utterance within a first similarity value. If there are more than one dialing utterances 
within the first similarity value, it is determined whether there is a pre-registered dialing 
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utterance similar to the input dialing utterance within a second similarity value higher 
than the first similarity value. If there is no dialing utterance within the second similarity 
value, names represented by the dialing utterances within the first similarity value are 
displayed. If a user selects one of the displayed names, a registered telephone number 
corresponding to the selected name is dialed, abstract). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the speech recognition of Baker and Chin with the telephone 
of Shon in order to provide the phone with a more robust method of speech recognition 
for voice dialing. 

Allowable Subject Matter 

33. Claims 7-9, 1 1 and 12 would be allowable if rewritten to overcome the 
rejection(s) under 35 U.S.C. 1 12, 2nd paragraph, set forth in this Office action and to 
include all of the limitations of the base claim and any intervening claims. 

34. Consider claim 7, Chen in view of Baker suggests the method of claim 1 , but 
does not fairly suggest further including generating a list of close call records wherein 
each of the close call records includes a close call score difference between the 
competing hypotheses, the score difference being used to construct the choice list, nor 
can the prior art of record be combined to fairly duplicate these limitations. Therefore 
claim 7 has allowable subject matter when combined with the limitations of claim 1 . 
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35. Claims 8 and 9 are contain allowable subject matter as they are dependent of 
claim 7. 

36. Consider claim 1 1 , Chen in view of Baker further in view of Olsen suggests the 
method of claim 10, further including with each of the close call records, a close call 
score difference between the competitor hypothesis and the score of the globally best 
hypothesis at the time the close call record is added, the close call score difference 
being used to determine which close calls to keep if the preset number of close call 
records is reached nor can the prior art of record be combined to fairly duplicate these 
limitations. Therefore claim 1 1 has allowable subject matter when combined with the 
limitations of claim 10. 

37. Consider claim 12, Chen in view of Baker suggests the method of claim 1 , but 
does not fairly suggest further including with each of the close call records, a first score 
and a second score, the first score being a close call score difference between the 
competing hypotheses, the second score being a global score difference between the 
competitor hypothesis and the score of a globally best hypothesis at the time the record 
is added, wherein the close call difference is used to construct the choice list, and the 
global score difference is used to determine which close calls to keep if the preset 
number of close call records is reached nor can the prior art of record be combined to 
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fairly duplicate these limitations. Therefore claim 12 has allowable subject matter when 
combined with the limitations of claim 1 . 

38. Claims 1 9 - 25, and 27 are allowed. 

39. Consider claim 1 9. Chen in view of Baker teaches a method according to claim 
16, further including storing in the close call list for each pair of partial hypotheses 
seeding a common word (i) a history of a first partial hypothesis, (11) a history of a 
second partial hypothesis (figure 1 of Chen shows nodes N2 and N6 with history nodes 
behind them), but does not fairly suggest (iii) a score difference being a difference 
between a score of the first partial hypothesis and a score of the second partial 
hypothesis, and (iv) a global score nor can the prior art of record be combined to fairly 
duplicate these limitations. Therefore claim 19 has allowable subject matter when 
combined with the limitations of claim 16. 

40. Consider claim 20, Chen in view of Baker a method of constructing a list of 
alternate transcripts from a recognized transcript, comprising: 

providing two or more partial hypotheses of an acoustic transcript; 

for each pair of partial hypotheses characterized by a first partial hypothesis 
having an associated first score and a second partial hypothesis having an associated 
second score being less than the first score, both ending at a common time and both 
seeding a common continuation word, evaluating the first partial hypothesis and the 
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second partial hypotliesis at each acoustic frame following the seeding of the common 
continuation word, and storing in a close call list a record of the first and second partial 
hypotheses, the record corresponding to the acoustic frame resulting in a smallest score 
difference between a current best overall scoring hypothesis and the second score, 
wherein the record includes at least (i) a history of the first partial hypothesis, (ii) a 
history of the second partial hypothesis and, 

generating one or more alternate hypotheses by combining information from at 
least one record in the close call list with the recognized transcript (see rejection of 
claim 16), 

But Chen in view of Baker does not fairly suggest: 

(iii) a score difference being a difference between the first score and the second 
score, and (iv) a global score difference being a difference between the current best 
overall scoring hypothesis and the second score; nor can the prior art of record be 
combined to fairly duplicate these limitations. Therefore claim 12 has allowable subject 
matter. 

41 . Consider claim 21 Chen in view of Baker teaches a method of constructing a list 
of alternate utterance hypotheses from a complete utterance hypothesis, comprising: 

providing two or more partial hypotheses of an acoustic utterance; 

for each pair of partial hypotheses characterized by a first partial hypothesis 
having an associated first score and a second partial hypothesis having an associated 
second score being less than the first score, both ending at a common time and both 
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seeding a common continuation word, evaluating tlie first partial hypothesis and the 
second partial hypothesis at each acoustic frame following the seeding of the common 
continuation word, and storing in a close call list a record of the first and second partial 
hypotheses, the record corresponding to the acoustic frame resulting in a smallest score 
difference between a cirrent best overall scoring hypothesis and the second score, 
wherein the record includes at least (i) a history of the first partial hypothesis, (ii) a 
history of the second partial hypothesis, (see claim 16 rejection). 
But Baker and Chen does not fairly suggest: 

(iii) a score difference being a difference between the first score and the second 
score, and (iv) a global score difference being a difference between the cun'ent best 
overall scoring hypothesis and the second score; 

for each acoustic frame, updating the two or more partial hypotheses until the 
acoustic utterance ends, and selecting a best scoring complete hypothesis; 

evaluating the records in the close call list for potential alternate utterance 
hypotheses, beginning with a record in the close call list having a smallest score 
difference and subsequently with each record in the close call list in an order of 
ascending score difference, by: 

(i) comparing a set of first words from the first hypothesis and a set of first 
words from one or more complete hypotheses from a choice list; 

(ii) if a set of first words from a history of the first partial hypothesis 
matches a set of first words from one or more complete hypotheses from the 
choice list, substituting the history of the second partial hypothesis for the history 
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of the first partial hypothesis within the one or more complete hypotheses from 
the choice list so as to generate one or more alternate utterance hypotheses, and 
placing the alternate hypotheses in the choice list; and, 

(iii) continuing evaluating the records in the close call list until filling the 
choice list. The prior art of record cannot be combined to duplicate these 
limitations, therefore claim 21 is allowed. 

42. Claims 22-25 are allowed as the are dependent on claim 21 . 

43. Consider claim 27, Chen in view of Baker teaches a method according to claim 
26, further including storing in the close call list for each pair of partial hypotheses 
seeding a common word (i) a history of a first partial hypothesis, (ii) a history of a 
second partial hypothesis (figure 1 of Chen shows nodes N2 and N6 with history nodes 
behind them), but does not fairly suggest (iii) a score difference being a difference 
between a score of the first partial hypothesis and a score of the second partial 
hypothesis, and (iv) a global score nor can the prior art of record be combined to fairly 
duplicate these limitations. Therefore claim 27 has allowable subject matter when 
combined with the limitations of claim 26. 

Conclusion 

44. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1 .136(a). 
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A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Douglas C. Godbold whose telephone number is (571) 
270-1451 . The examiner can normally be reached on Monday-Thursday 7:00am- 
4:30pm Friday 7:00am-3:30pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Patrick Edouard can be reached on (571) 272-7603. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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